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The high frequency of Severe Acute Respiratory Syndrome Coronavirus-2 (SARS-CoV-2) 
mutations and homology of the Angiotensin-Converting Enzyme-2 (ACE2) cell receptors 
in various hosts help the virus transcend species barriers. In this study, we investigated the 
mutations of the SARS-CoV-2 spike glycoprotein detected in cats and their effect on its 
structure and function. Interestingly, some of these mutations are reported here in cats for 
the first time. Structural analysis showed seven residue substitutions in the spike 
glycoprotein. Four of the detected mutations are located on the spike surface, which 
are critical interaction points for neutralizing antibodies. Furthermore, three of the reported 
mutations could facilitate viral binding to the ACE2 host receptor, influence S1/S2 
cleavage, destabilize the B-hairpin structure of the S2 and enhance viral infectivity. 
Structural modeling and phylogenic analysis of the ACE2 receptor provided an 
indication of the binding capacity of the virus to the specific cell receptors of different 
species and hosts. The presented work highlights the effects of the residue substitutions 
on viral evasion, infectivity and possibility of SARS-CoV-2 spillover between humans and 
cats. In addition, the work paves the way for in-depth molecular investigation into the 
relationship between SARS-CoV-2 receptor binding and host susceptibility. 
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INTRODUCTION 


Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), which triggered the Coronavirus 
Disease 2019 (COVID-19) outbreak, has caused enormous socioeconomic losses (Zhou et al., 
2020). This virus is a member of the Nidovirales order, the Coronaviridae family and the 
Coronavirinae subfamily in the Betacoronavirus genus. SARS-CoV-2 is a spherical pleomorphic 
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virus (50-140 nm in size). It is an enveloped virus with large, 20 
nm-long, club-shaped peplomers that form a crown appearance. 
It has a helical nucleocapsid with the largest positive sense RNA 
(29,903 bases) (Lu et al., 2020). There are two protein groups in 
SARS-CoV-2 (structural and non-structural). The structural 
proteins include the spike glycoprotein (S), the nucleocapsid 
(N), the membrane protein (M) and the envelope protein (E) 
(Naqvi et al., 2020); whereas the non-structural proteins include 
the protease (NSP3 and NSP5), the RNA-dependent RNA 
polymerase protein (NSP12) (Begum et al., 2020), helicase 
(NSP13), endoribonuclease (NSP15) and other NSPs (NSP1, 
NSP2, NSP4, NSP6, NSP7, NSP8, NSP9, NSP10, NSP11, 
NSP14 and NSP16) that associated with immune suppression, 
viral replication and transcription (Chan et al., 2020). 

The SARS CoV-2 spike glycoprotein initiates a viral infection 
through its attachment to angiotensin-converting enzyme 2 
(ACE2) cell receptors, which are distributed widely in different 
organs, including oro/nasopharyngeal mucosae, the lungs, 
gastrointestinal tract (GIT) mucosae, the brain, kidneys and 
the liver (Hamming et al., 2004). The spike glycoprotein is a 
trimeric fusion molecule (as illustrated in Figure 1), and its 
corresponding gene consists of 3,822 nucleotides (21,563- 
25,384) that encode 1,273 amino acids. The spike glycoprotein 


is composed of two main subunits, an amino (N)-terminal 
S1 subunit and a carboxyl (C)-terminal S2 subunit, that 
are cleaved at the furin cleavage site (S1/S2 cleavage region) 
(Chan et al., 2020). The detailed molecular structure of the spike 
gene with detailed regions and locations is illustrated in Figure 1 
and tabulated in Supplementary Table $1, according to the 
Wuhan strain (Reference Sequence NC_045512.2, GeneID: 
43740568) (Guruprasad, 2021; NCBI, 2021). The S1 subunit 
interacts with the ACE2 receptor subsequently promoting the 
viral infection (Hoffmann et al., 2020), and this interaction has 
an impact on both cross-species and human-to-human 
transmissions (Lu et al., 2015; Wan et al., 2020). The S1 
subunit switches between a standing up receptor-binding 
domain (RBD) (e.g., one or two RBD-up conformers), 
amenable to ACE2 binding, and closed-down positions, 
undergoing spontaneous conformational transitions between 
ensembles of closed and open receptor accessible forms (Wan 
et al., 2020; Wang et al., 2020). In contrast with the buried S2 
subunit, the S1 subunit domains are located on the surface of the 
spike glycoprotein, effectively protecting the fusion apparatus 
(Shang et al., 2020). Upon proteolytic activation at the S1/S2 
region, the S1 subunit is dissociated from the S2 (Shang et al., 
2020). Following the latter dissociation event, the S2 subunit 
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FIGURE 1 | The structure of the spike glycoprotein and its coding gene. (A) shows a schematic surface view of the spike ectodomain and its constituting domains and 
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regions. (B) illustrates the structure of the native spike glycoprotein in its pre-fusion and post-fusion states as determined by Cryo-EM (PDB ID 6xr8 and 6xra, respectively) (Cai 
et al., 2020) in which the spike trimer is composed of three interacting chains or protomers depicted here in distinct colors. (C) demonstrates the detailed structure of the spike 
gene with the locations of the different regions shown according to the Wuhan strain (Reference Sequence NC_045512.2, Gene ID: 43740568) (Guruprasad, 2021; NCBI, 
2021). The spike glycoprotein is composed of 2 subunits (S1 and S2). The S1 subunit includes the N-terminal domain (NTD), the receptor binding domain (RBD) which carries 
the receptor-binding motif (RBM), and two structurally conserved subdomains (the C- terminal domains 1 (CTD1) and the C- terminal domains 2 (CTD2). The S2 subunit 
contains the N-terminal hydrophobic fusion peptide (FP), the fusion peptide proximal region (FPPR), the heptad repeat 1 (HR1) motif, the central helix region (CH), the B-hairpin 
region, the connector domain (CD)and the heptad repeat 2 (HR2) motif. These are followed by the transmembrane region (TM) and the intracellular regions (IC) (Guruprasad, 
2021; NCBI, 2021). The figure was assembled with Biorender website (Biorender, 2021). 
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undergoes a series of enormous structural rearrangements in 
order to orchestrate the fusion of cellular and viral membranes 
(Duan et al., 2020; Shang et al., 2020; Wan et al., 2020; Wang 
et al., 2020). 

SARS-CoV-2 has evolved at a rate of approximately 1.5-3.3 x 
10 */per site/year (Chaw et al., 2020). Consequently, residue 
substitutions may affect viral infectivity, pathogenesis, 
transmission and viral virulence (Shafique et al., 2020). The 
high mutation rate and recombination frequency is a significant 
dilemma that may have a role in cross-species transmission 
(Elaswad et al., 2020). SARS-CoV-2, which has a bat precursor, 
evokes many questions regarding its natural origin, its 
transmissibility to different animal hosts and the possible 
incidence of animal viral reservoirs (Elaswad et al., 2020). 
Many experiments have investigated the susceptibility of 
different species to SARS-CoV-2 infection, including non- 
human primates, ferrets, cats, dogs, minks, tigers, mice and 
pigs (Hobbs and Reid, 2021). Furthermore, a case of zoonotic 
transmission from minks to farm workmen in Denmark was 
investigated (Prince et al., 2021). 

The cross-species transmission of coronaviruses and their 
ability to jump species barriers have been reported. Moreover, 
many research papers have investigated the factors that may 
strengthen the capability of coronaviruses to overcome species 
barriers and increase their virulence (Elaswad et al., 2020). These 
features are illustrated not only by the high mutation rates of 
coronaviruses (Su et al., 2016), but also by the large RNA genome 
of these viruses that allows the occurrence of recombination 
and mutation, resulting in the emergence of new coronaviruses 


(Ye et al., 2020). Despite the importance of these features, the 
interaction ability of coronaviruses with different ACE2 
receptors has become the dominant and most important 
virulence factor of these viruses (Bolles et al., 2011). According 
to the fifth report of the Office International des Epizooties (OIE) 
regarding the SARS-CoV-2 situation up until the end of 
September 2021, 584 outbreaks in animals (mostly in minks, 
cats and dogs) were reported globally, affecting 12 species in 30 
countries (as demonstrated in Figure 2) (OIE, 2021). Therefore, 
the OIE strongly encourages the reporting of SARS-CoV-2 
occurrence in animals (OIE, 2021). 

Furthermore, in our recent study, we reported SARS-CoV-2 
spillover between humans and pet animals in Egypt in 30.3% of 
cats (10/33) and 24.2% of dogs (8/33), as evidenced by real-time 
reverse transcriptase polymerase chain reaction (rRT-PCR) 
(Mervat et al., 2021). Moreover, thrombocytopenia, 
lymphocytopenia and high levels of ferritin, C-reactive protein 
and D-dimers were reported in hematology and clinical studies 
(Mervat et al., 2021). 

In order to further investigate such interspecies transmission, 
in this study, we conducted a full spike gene sequencing of SARS- 
CoV-2 detected in four cats, and their treating veterinarian. 
Samples selection was based on the severity of clinical signs. 
Sequencing was combined with computational analysis, 
including structural visualization and homology modeling of 
the glycoprotein to investigate the mutations and their effect on 
its structure and function. Structural modelling of the ACE2 
receptor in different species was conducted to study the host 
susceptibility to SARS-CoV-2 infection. 
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FIGURE 2 | COVID-19 outbreaks in animals. The figure illustrates the number of outbreaks reported worldwide by species according to the fifth OIE report (till 30 
September 2021) (OIE, 2021). The figure was assembled with Biorender website (Biorender, 2021). 
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MATERIALS AND METHODS 
Ethical Approval 


The study was conducted according to ethical guidelines and 
approved by the Ethics Committee of Animal Health Research 
Institute (protocol code 11429 and date of approval 12/2020). 


Sample Collection and Preparation 

Swabs from oro/nasopharyngeal mucosa were collected from 
four cats, (three of these cats had severe respiratory signs and one 
of them were a symptomatic), that were in contact with COVID- 
19 positive owners. Additionally, a sample was taken from a 
veterinarian who had been in contact with these cats in his clinic 
and he was asymptomatic. These samples were collected in 
December 2020, March 2021 and July 2021. The swabs were 
deposited into sterile tubes containing 2 ml of autoclaved 
phosphate buffered saline (PBS). The latter samples were 
collected independently from different houses and represent 
independent outbreaks. The possibility of SARS-CoV-2 
spillover was based on infection’s chronology, symptoms onset 
and case history, as owners were infected with SARS-CoV-2 
before symptoms appear on their animals, and hence they likely 
have transmitted the virus to their companion animals. 


Nucleic Acid Extraction and 

Molecular Diagnosis 

According to the manufacturer’s instructions, viral ribonucleic 
acid (RNA) was extracted automatically using a MagPurix 
extractor (Labgene scientific, Switzerland). This was followed 
by a SARS-CoV-2 screening of the extracted RNA with rRT-PCR 
using a transgenbiotech multiplex SARS-CoV-2 diagnostic kit 
(Cat no. DV101) that detects two genes (nucleocapsid and 
ORFlab genes). The cycling protocol of the rRT-PCR was 
composed of 10 minutes at 50°C and 30 seconds at 95°C, 
followed by 40 cycles of 5 seconds at 95°C then 30 seconds at 
60°C. 


Full-Length Spike Gene Amplification 

The full-length spike gene of the spike glycoprotein was 
amplified for the rRT-PCR positive samples using conventional 
RT-PCR utilizing spike gene-specific primers, as reported 
previously (Ren et al., 2020) and tabulated in (Supplementary 
Table S2). Briefly, the RT-PCR reaction was set up using an RT- 
PCR kit (PrimeScript, Takara, California, USA). The cycling 
conditions were as follows: a cycle of 30 minutes at 50°C for the 
activation of reverse transcriptase (RT); then 2 minutes at 95°C 
for the deactivation of the RT; followed by 40 cycles of 95°C for 
30 seconds, 50°C for 30 seconds and 72°C for 1.5 minutes; with a 
final extension cycle of 72°C for 10 minutes. 


Sequencing and Phylogenetics 

According to the manufacturer’s recommendations, positive 
DNA bands with the expected sizes were cut and purified 
using a gel extraction kit (QIAquick, Qiagen). The BigDye 
Terminator 3.1 Cycle Sequencing Kit was used to sequence the 
purified products (Applied Biosystems, USA). Next, the Centri- 
Sep Purification Kit (Applied Biosystem) was used to purify the 


sequencing reactions. The purified sequence products were then 
decoded using the 3500 Genetic Analyzer (Applied Biosystems). 
The BioEdit Sequence Alignment Editor 7.2.5 (Hall, 1999) was 
used to prepare protein multiple sequence alignments. MEGA7 
software was used to create a midpoint rooted maximum 
likelihood phylogenetic tree with 1000 bootstrap replicates to 
verify the tree (Kumar et al., 2016). This tree depends on the 
Jones-Taylor-Thornton (JIT) matrix-based model (Jones et al., 
1992) and the tree is illustrated with the highest likelihood log 
value (-11894.02) in which related taxa and closely related 
branches are clustered together. The initial tree for the 
heuristic search was automatically generated with application 
of the neighbor join and BioNJ algorithms to a matrix of pairwise 
distances predicted using the JIT model, then the topology with 
the highest log likelihood value was chosen. The branch lengths 
were calculated by the number of residue substitutions per site, 
and the tree is depicted to scale. MEGA7 software was used to 
display the evolutionary analysis (Kumar et al., 2016). 


Structural Visualization and Obtaining a 
Homology Model of the Spike Mutant 
Cartoon illustrations were generated using UCSF Chimera 
molecular visualization software (Pettersen et al.,2004). Atomic 
coordinates from PDB (ID 6xr8 and 6xra (Cai et al., 2020) were 
used to represent the native structures for the pre-fusion and 
post-fusion (S2) spike glycoprotein ectodomain trimer, 
respectively; whereas atomic coordinates from PDB ID 7krq and 
7krr (Zhang J et al., 2021) were used to represent the (D614G) 
closed and one RBD-up open conformers, respectively. The 
homology model of the spike glycoprotein chains (i.e., trimer) 
for the sequence presented in this study (hCoV-19/Cat-Kitten/ 
Egypt/AHRI/July/2021) was generated using the MODELLER 
software (Eswar et al., 2003), with the (D614G) model (PDB ID 
7krr, which is in the open state with one RBD-up) as a template. 
The sequences were aligned using the MUSCLE program (Edgar, 
2004), and the columns corresponding to missing residues in the 
template model were removed before submission to the 
MODELLER program. 


Homology Modeling of the ACE2 
Receptors in Various Species and Their 
Phylogenetic Analysis 

To be able to compare the receptors from various animal 
species, a BLASTP (Altschul et al., 1990) search was performed 
using the reference sequence for human ACE2 receptor against the 
NCBI Protein Reference Sequence database (Refseq_protein). Over 
1200 significant hits were found. Sequences of the best 249 hits (ie., 
those with the highest sequence identity) were retrieved and aligned 
using the COBALT multiple sequence alignment tool 
(Papadopoulos and Agarwala, 2007) through the NCBI 
server. Throughout this paper, all residue numbers for the 
receptor refer to the human ACE2 isoform 1 protein reference 
sequence (accession: NP_001358344.1). The alignment blocks of 
interest were extracted using the extractalign tool in the EMBOSS 
package (Rice et al., 2000). The alignment blocks for the main 
residues (or regions) that were in contact with the viral RBD in the 
structure of the RBD-ACE2 complex were used to generate 
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sequence logos using the WebLogo Server (Schneider and Stephens, 
1990; Crooks et al., 2004), namely alignment blocks incorporating 
residues 30-41, residues 82-84 and residues 353-357 were used. 
The residues coding for the signal peptide (residues 1-17) were 
removed from the alignment. The alignment blocks of residues 19- 
90 and of 324-393 were joint, and were then used for the 
construction of the ACE2 phylogenetic tree. 

To obtain further structural insights into the differential binding 
capacity of the various host species receptors with the unmutated 
viral RBD, we built homology models of the receptor-RBD 
complexes of five distinct species. Specifically, models from cat, 
dog, ermine, ferret and chicken ACE2 receptor sequences were built. 
Homology models were built (also using MODELLER software) for 
the peptidase domains (PD domain, residues 18-614) of the ACE2 
sequences of each species with a fixed COVID-19 RBD domain. For 
simplicity, glycosylation sites, which are far from the receptor- 
binding interface, as well as bound water molecules in the template 
structures were ignored. However, the Zinc atoms were included in 
the built homology models as they may be essential for stabilizing 
the structures. The receptor-RBD complexes (PDB IDs 6LZG and 
6M17) were used as templates. A model for the human ACE2 
isoform 1 protein reference sequence was also built and used as a 
control. For each species-specific receptor-RBD complex, the model 
with the best score using MODELLER was used in the 
subsequent steps. Hydrogen atoms were added to each model 
assuming a neutral pH = 7.0, and the structure was then 
minimized using all atom Amber14 force fields (“amber14-all”) 
(Maier et al., 2015) in vacuum to remove any clashes. The energy 
minimization was performed using OpenMM Python API 
(Eastman et al). Isosurface representations and electrostatic 
(Coulombic) potential coloring of the isosurfaces were performed 
using Chimera (Pettersen et al., 2004). 

Next, a phylogenetic tree was constructed for all residues at 
the binding interface between the ACE2 and the viral RBD (19- 
90 and 324-393) in different species. MEGA7 software was used 
to create a midpoint rooted maximum likelihood tree with 1000 
bootstrap replicates to verify this tree (Kumar et al., 2016). A 
maximum likelihood technique depending on the JIT matrix- 
based model was used (Jones et al., 1992) with the highest 
likelihood log value (-1606.65). The related taxa and closed 
branches were clustered together. The initial tree for the heuristic 
search was automatically generated with the application of the 
neighbor join and BioNJ algorithms to a matrix of pairwise 
distances predicted using the JIT model, then the topology with 
the highest log likelihood value was chosen. The branch lengths 
were calculated by the number of residue substitutions per site, 
and the tree was depicted to scale. MEGA7 software was used to 
display the evolutionary analysis (Kumar et al., 2016). 


RESULTS 
Real-Time RT-PCR (rRT-PCR) 


Based on rRT-PCR molecular diagnosis, the four cat samples and 
the veterinarian sample were SARS-CoV-2 positive (as illustrated 
in Supplementary Table $3) with cycle threshold (CT) values of 
nucleocapsid (N) and ORFlab genes. 


Sequencing and Phylogenetics 

All obtained sequences of the five samples were uploaded to the 
National Centre for Biotechnology Information (NCBI) with 
accession numbers OK144251 to OK144255. Full spike gene 
sequencing and phylogenetic analysis revealed that sample 
hCoV-19/Cat/Egypt/AHRI/December/2020(OK144251) 
belonged to clade 20A and illustrated high identity with NC 
045512.2 L/Wuhan-Hu-1 (reference strain) and 
Egypt_NRC1_2020| EPI_ISL1315064|2020-04-26 (99.9%- 
100%). hCoV-19/Cat/Egypt/AHRI/March/2021(OK144252) 
was quite identical to Egypt/PHARCO-ARMY| 
EPI_ISL1936365/03-2021 with high identity (99.9%), and both 
of them are related to clade 20C. On the other hand, hCoV-19/ 
Cat-Mother/Egypt/AHRI/July/2021(OK144253), hCoV-19/Cat- 
Kitten/Egypt/AHRI/July/2021(OK144254) and hCoV 19/ 
Veterinarian/Egypt/AHRI/July/2021(OK144255) were similar 
to a human sample collected in the same month (hCoV-19/ 
Egypt/CPHL-$25/2021| EPI_ISL3274157|2021-07-08) with high 
identity (99.7%), and they belonged to clade 21A (Delta). The 
latter samples had some similarity with MZ266636.1 SARS-CoV- 
2/human/JORDON/AM-HU-16/2021 with high identity 
(99.5%), as presented in Figure 3 and Supplementary Table S4. 


Amino Acid Substitution 

In this study, seven critical amino acid substitutions were 
observed in spike glycoproteins, all of them were non- 
synonymous, namely (W152R; S:T454A) in the N-terminal 
domain of the S1 subunit, (L452R; S:T1355G) in the receptor- 
binding motif (illustrated in Supplementary Figures S2B, C), 
(A570V; S:C1709T) in C-terminal domain 1 (CTD1), (D614G; S: 
A1841G) which is found downstream of the receptor-binding 
domain in C-terminal domains 2 (CTD2), (Q677H; S:G2031T) 
in the S1/S2 cleavage site, (A899S; S:G2695T) in the fusion- 
peptide proximal region (FPPR) and (S1051Y; S:C3152A) in the 
B-hairpin region of subunit 2 (S2). Interestingly, some of these 
mutations have not been previously detected in cat samples in 
Hong Kong, France, Italy or the USA. Hence, this is the first 
report of these mutations in cats. The latter mutations were 
(W152R), (A570V), (Q677H), (A899S) and (S1051Y), as shown 
in Table 1. 


The Overall Distribution of Mutations and 
Impact on the Structure and Stability of 
the Spike Protein 

The overall distribution of the mutations mapped onto the 
predicted structure (homology model) of the studied sequence 
(Figure 4) showed that four residue substitutions were located 
on the viral surface (Figure 4A), which were (W152R), (L452R), 
(A570V) and (A899S). Furthermore, the (D614G) mutation, 
which is found downstream of the receptor-binding domain 
(RBD), leads to a loop misplacement that results in structural 
rearrangements (Zhang J et al., 2021). 

More specifically, in the closed trimer state, the D614 
structure was found to have disordered loops between residues 
620 and 640 (designated as 630 loops) within its monomers. 
Upon mutation, the G614 showed ordered loops instead, as a 
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77 | USAIIL-CDC-ASC210047375/0K296513.1:21555-25376/2021 (Kappa) 
—i USA/CA-CDC-STM-000042578/ OK 194572. 1:21533-25354/2021 (Kappa) 
IndiafhCoV-1 9/DL/602 1/6054/3665668|202 1-04(Kappa). 
SouthAmerica/hCoV-19/2021/3944425|2021-08(Mu). 
t— USAJFL-CDC-ASC210032622/OK296824.1:21561-25382/20211 (Mu) 
* TV UsaiuT-coc-ASC210107855/0K296185.1 21533-25354/2021 (Mu) 
A hCoV-19/Egypt/CPHL/9047444|2021-12- 
hCoV-19/Bo 


Dog/USAVFL-21-002342-001|2958983|2021-01-15.(lota) 
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USA/OR-CDC-ASC210072030/0K291415.1:21550-2537 1/2021 (lota) 
USA/OR-CDC-QDX24476241/0K251054.1:21500-25321/2021 (Iota) 

USAVIL-CDC-LC0072392/ 0K233736.1:21244-25065/2021 (Beta) 

KEN/K100818/0K449138.1:21509-25330/2020(Beta) 

South Africa/R20090-20/940143(Beta). 

Brazil/SP-IB 101399/2021|1967314|(Gamma). 

USA/2105200446/MZ799138.1:21516-25337/2021(Gamma) 
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99 | @ Cat/BraziRU-INCA-Vett76]6470123|2021-02-01.(Gamma) 
© Cat/Brazil/BA-UFOB|4565991|2021-05-01.(Gamma) 

USA/MD-IGS-0121 10204642A/2021|1967601 (Eta). 

— SLE/SL34/MZ854393.1:21526-25341/2021(Eta) 

GHA/nmimr/TRA-190/MW598416. 1:21554-25369/2021 (Eta) 

© Cat/Argentina/PAIS-C0164/4875726|2021-05-10.(Alpha) 
|@ CatFrance/un-UCN-JockerR}3857106|2021-03-22.(Alpha) 
SS) a. hCoV-19/E gyp/ARMY-404/1969081|2021-04-24(Alpha) 
A. hCoV-19/EgypEMC-1/2960150|2021-04-21(Alpha) 
us A. hCoV-19/E gypt/CCHES7357/2566505|2021-03-07(Alpha) 
England/LOND-128A79B/2021|1391414|(Alpha). 
Dog/Switzerland/TI-USZ 14|4549293|2021-06-03.(Alpha) 
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@ hCov-19/Cat/Egypt/AHRI/March/2021(OK144252) 
‘A Egypt/PHARCO-ARMY-78/03-2021 
‘A. EgypMASRI-C4-029/2020]1165078|2020-06-28 
Dog/lUSATX-TAMU-077 2020|699508|2020-07-28(20C) 
@ Cat/USATX-TAMU-078 2020|699509|2020-07-29(20C) 
@ CalUSA/UT-CDC-4159739-001|9936852|2020-08-26,(20C) 
@ CatPeru/sc2cat1|2791488|2021-03-26 (Lambda) 

99 | @ Cat/Perulsc2 cat3|2791490|2021-05-02.(Lambda) 
USA/FL-CDC-ASC210162736/0K321046.1:21524-25324/2021 (Lambda) 
PeruhCoV-19/3431/2021/3940405|2021-07(Lambda) 

71 | @ CaVUSA/AZ-21-004025-001|2930561 [2021-01-29.(Epsilon) 
@ CaVUSAICA-21-003696-001|2958980|202 1-01-30.(Epsiton) 

98 | USA/CA-CDPH-2000023076/0K173427..1:21540-25361/2021 (Epsilon) 
USAWA-CDC-LC0033839/0K230872.1:21510-25331/2021 (Epsilon) 
USA/CA-COLOR-UCSF-CC198/2021|1824917|(Epsilon). 

Dog/Hong Kongi20-02756/2020|4 145 18|2020-02-26(20B) 
—| Mink Netherlands NB-EMC-21-52020|5230711|2020-07-09 
© CatHong kong 20-04236/2020|759858}2020-03-30(208) 
@ hCoV-19/Cat-Kitten/EgypvAHRIduly/2021 (OK 144254) 
}@ hCoV-19/Veterinarian/EgypvAHRI/July/2021(OK144255) 
@ hCoV-19/Cat-Mother/EgypUAHRI/July/2021(OK144253) 
‘A hCoV-19/EgyptiCPHL-S25/2021|3274157|2021-07-08 
MZ266636.1 SARS-CoV-2/human/JORDON/AM-HU-16/2021 
© Cat/Switzerland/ZH-UZH-P1841|11583333|2022-01-08.(Delta) 
95 | A hCoV-19/EgyptICPHL-S26/2021 |36007211|2021-07-15(Delta) 
© hCoV-19/cat/Belgium|3128536|2021-07(Delta) 
India/nCoV-19/3665669|202 1-05(Delta). 

F | @ CavUSA/OH-21-037782-001]8145732|2021-12-10.(Delta) 
Dog/USA/VA-21-032400-003/2021|EPI ISL 6088079|2021-10-22.(Delta) 
Dog/USA/WA-DDL21-10981-4|7845317|2021-07-15.(Delta) 

@ CatUSAICA-21-035224-002/7974436|2021-11-15.(Delta) 
A EgyptNRC1 2020|1315064|2020-04-26 
@ hCoV-19/Cat/EgypvAHRIDecember/2020(0K144251) Clade 208 
Mink Netherlands 1 2020|431778|2020-04-24 
© Catitaly VE-IZSVe-20D1A30040-22020/962892|2020-11-27(20A) 
Dogiltaly 399-208 2020|730652|2020-11-04(20A) 
© CatFrance IDF-532020/437349|2020-04-17(20A) 
LM NC045512.2L/Wuhan-Hu-t (Reference) Clade 19 A 


0.0010 


SARS-CoV-2 Wuhan-Hu-1 human reference strain. 


|n 2aH(Mu) 


hCoV-19/deer/USA/NY/9428788|2022-01-06(Omicron) 


Clade 21F(Tota) 


J 20H(Beta) 


[Clade 201(Alpha) 


Clade 21G(Lambda) 


[Clade 21C(Epsilon) 


[Clade 213(Delta) 


FIGURE 3 | Phylogentic analysis of the SARS-CoV-2 sequences. The rooted maximum likelihood tree shows the relationship between sequenced samples and 
current circulating variants in humans and animals in which the related taxa and closed branches are clustered together illustrating SARS-CoV-2 evolution. The blue 
circles illustrate the sequenced SARS-CoV-2 that were detected in cats in the present study while the red circles display the detected SARS-CoV-2 in cats 
worldwide from 2020 till January 2022. The green triangles depict circulating SARS-CoV-2 strains in Egypt within the same time frame. The red square refers to 


Clade 218(Kappa) 


14(Omicton) 
tswana/R74B95/9342666|2022-01-24 (Omicron). 


Clade 21K(Omicron) 
hCoV-19/syrian hamster/India/8542942|2021-11 (Omicron) 


Dog/USAVGA-22|10656102|2022-01-01.(Omicron) 
99 | @ Cat/Switzerland/SG-UZH|1 15833 19|2022-01-06.(Omicron) 


Clade 203(Gamma) 


Clade 21A(Delta) 


consequence of the elimination of a salt bridge between D614 of 
CTD2 and K854 in the FPPR. It was concluded that such a 
mutation leads to the stabilization of the closed trimer state. On 
the contrary, the one RBD-up state requires at least one of the 
loops to be disordered and the second one to be partially 


disordered. The latter state precedes the formation of two and 
three RNA-up states, which are the precursors of the S2 post- 
cleavage state. The (D614G) mutation, therefore, slows down the 
formation of one RBD-up state, as well as the shedding of the S1 
before the post-cleavage state formation. The accumulation of 
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TABLE 1 | Amino acid substitutions in the sequenced samples in comparision to some circulating variants in human and cats. 


No Region/Amino acid Position 
NTD RBM CTD1 CTD2 $1/S2 FPPR B-hairpin 
W152 L452 A570 D614 Q677 A899 $1051 

0 NC 045512.2 L/Wuhan-Hu-1 (Reference) Ww L A D Q A Ss 

1 Cat Hong Kong 20-04236/2020|759858|2020-03-30 G 

2 Cat_France_IDF-53_2020|437349|2020-04-17 G 

3 Cat_France_Env-Ba_2020|483063 G 

4 Cat_Italy_VE-IZSVe-20DIA300402-2020|962892|2020-1 1-27 G 

5 Cat_USA_TX-TAMU-078_2020|699509|2020-07-29 F G 

6 hCoV-19/cat/Belgium|3128536|2021 -07(Delta)(4.1) R G 

7 hCoV-19/Cat/Egypt/AHRI/December/2020 (OK144251)* G 

8 Egypt_NRC1_2020] EPI_ISL1315064|2020-04-26 ‘ G : 

9 hCoV-19/Cat/Egypt/AHRI/March/2021 (OK144252)* R G H 

10 Egypt/PHARCO-ARMY|EPI_ISL1936365/03-2021 : R ‘ G H ; ‘ 

11 hCoV-19/Cat-Mother/Egypt/AHRI/July/2021 (0K144253)* R R Vv G H S) ve 

12 hCoV-19/Cat-Kitten/Egypt/AHRI/July/2021 (OK144254)* R R Vv G H S) ng 

13 hCoV-19/Veterinarian/Egypt/AHRI/July/2021 (OK144255)* R R Vv G H Ss Y 

14 hCoV-19/Egypt/CPHL-S25/2021| EPI_ISL3274157|2021 -07-08 R R G H S) 

15 MZ266636.1 SARS-CoV-2/human/JORDON/AM-HU-16/2021 R R G H Ss 


*The sequenced samples in this study. 


the one RBD-up state, but not the other RBD-up states, is 
thought to explain the increased infectivity, but at the same 
time, the reduced overall binding affinity with the receptor. As a 
consequence, the formation propensity of a post-cleavage state is 
decreased in viral strains with a (D614G) mutation (Zhang J 
et al., 2021). 

The (A570V) and the (S1051Y) mutations are of particular 
interest as they are located at the interface of different chains and 
domains. Supplementary Figure S1 illustrates that the residue 
570 is surrounded by several other hydrophobic residues in both 
native and predicted structures. The increase in hydrophobicity 
of V570 compared with A570 suggests that this mutation is likely 
to contribute to the stabilization of this residue in place. 

The impact of (S1051Y), which is located in the B-hairpin region 
of subunit 2 (S2), is illustrated in Figures 4C-E. The pre-cleavage 
structures of the native sequence, in both the closed and open 
conformations, show that the residue $1051 is in contact with two 
histidine residues, namely H1048 and H1064, as illustrated in 
Figures 4C-E. In the closed pre-cleavage structure, the distances 
between the hydroxyl oxygen of the serine $1051 to the nearest 
nitrogen atoms on the rings of H1048 and H1064 are 2.71 and 2.77 
A, respectively. One of the two histidines, H1048, is on the same side 
of the B-sheet strand as $1051, whereas the other (H1064) is on the 
other strand of the B-sheet hairpin. The serine residue is therefore 
bridging the interaction between the two histidines, effectively 
stabilizing the hairpin and the overall structure of the B-sheet. In 
contrast, the post-cleavage structure shows that one of the two 
histidines (H1064) moves away from the serine residue, whereas 
1048 remains in close proximity. The distances between the serine 
oxygen and the closest ring nitrogen atoms of the histidines are 2.82 
and 4.4 A for H1048 and H1064, respectively. This is due to a shift 
that is seen of the relative orientation of the two strands of the B- 
hairpin upon the unfolding of the helices in the post-cleavage 
conformation. Such large conformational changes increase the 
mechanical stress on the B-hairpin structure as well as on the 
nearby disulphide bridge. The predicted model of the (S1051Y) 


mutation shows that the histidine interactions are diminished in the 
pre-cleaved conformer, which may contribute to the destabilization 
of the hairpin and the overall stability of the B-sheet. On the other 
hand, these hydrogen bonds break may contribute to lowering the 
energetic barrier to the large conformational changes required for 
the post-cleavage structure to be formed. 


Structural Modeling of the ACE2 Receptor 

in Various Species and Their Phylogenetic 

Analysis Deciphers Host Susceptibility 

As the mutations that were found in the cat samples have been 
previously reported in humans, there must be other factors that 
determine the host range of the virus, including viral entry into 
the host cell. To further understand the factors that control host 
range and host susceptibility, we compared the sequences of the 
host receptor, ACE2 (angiotensin-converting enzyme 2), in 
different hosts. The ACE2 receptor is a highly conserved 
protein that performs a specific enzymatic function in 
mammals. We performed an initial alignment of the ACE2 
protein isoform 1 from humans with that of susceptible 
companion animal hosts, (namely, cats and dogs) and with a 
species that have not been reported to be infected with SARS- 
CoV-2 (ie. an unsusceptible control species). For the latter 
purpose, we used the ACE2 protein from chickens. 

In this initial comparison, we focused on the residues that 
have been shown to have direct contact with the virus RBD 
domain (e.g., from PDB IDs 6LZG and 6M17). Table 2 shows a 
comparison between the key residues that form such polar and 
hydrophobic contacts. One can ascertain that the number of 
residues substituted in such a way that alters the charge 
distribution of the binding interface in dogs (3 out of the 10 
residues are substituted) is more than that in cats (2 out of the 10 
residues are substituted), and that such substitutions are the most 
in chickens (7 out of the 10 residues are substituted). We also 
noticed that there seems to be no clear pattern or preference in 
these substitutions, but rather an overall change in the charge 
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FIGURE 4 | The overall distribution of the spike protein mutations observed in the studied sequences and the impact of (S1051Y) mutation on its structure and 
function. (A) shows the overall distribution of the mutations, found in the sequenced samples in this study, mapped onto the predicted structure by homology 


S1051Y 


modelling. The mutated residues in one monomer are depicted in red and the monomer with RBD-up is depicted in dark blue. The isosurface of the trimer is shown 
on the left and a rotated view (by 180o) of the latter is shown in the center, while the atomic details are shown on the right. (B) illustrates the predicted structure of 
the sequence presented in this study, depicted in light blue, superposed onto the template used to generate it (depicted in gold, derived from PDB ID 7krr). (C-E) 
illustrate molecular details of the impact of (S1051Y) on the structural stability and function of spike protein. In the presence of the native serine 1051 (PDB ID 7krr), 
the residue interacts with two histidine residues (His 1048 and His 1064), as illustrated in (C) (top view, showing hydrogen bonds depicted by the cyan lines) (Zhang 
J et al., 2021). Such interactions are likely to stabilize the beta-sheet within the native conformer. These interactions are likely to be abolished by the mutation 
(S1051Y), as shown in the predicted model by homology modelling (E). (D) shows the interaction of the serine and the two histidine residues in the unmutated post- 
cleavage structure [PDB ID 6xra (Cai et al., 2020)]. The figure illustrates that post-cleavage structural changes in the protein lead to bond break between the serine 


residue and His 1064. 


distribution was observed. More specifically, a polar to 
hydrophobic residue substitution Q24L is seen in cats and 
dogs. Whereas in chickens, a polar to a charged residue 
substitution Q24E is seen. Residues D31 in humans or E31 in 
cats and dogs, are substituted with A31 in chickens. The polar 
and positively charged residues Q32 and K31, respectively, which 
are conserved in humans, cats and dogs are substituted with 
negatively charged residues (K31E and Q32E) in chickens. The 
hydrophobic methionine residue M82, is substituted with a polar 
residue in cats and dogs, and even with a charged residue in 


chicken. Finally, the tyrosine residue Y83, which is conserved in 
human cats and dogs, is substituted with a more hydrophobic 
residue (phenylalanine) in chickens. Additionally, the alignment 
showed several residues on the surface of the receptor in close 
proximity to those that form direct contacts with the RBD, which 
undergo substitutions that also alter the binding interface charge 
distribution in unsusceptible species compared with susceptible 
ones. It is possible that these neighboring residues may affect the 
overall binding affinity of the viral RBD with the host receptor. 
For instance, some substitutions may introduce an overall 
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TABLE 2 | Substitutions in key residues involved in viral recognition on the ACE2 
surface in selected species. 


Human Cat Dog Chicken 
Q24 L L E 
D30 E E A 
K31 K K E 
H34 H Y V 
Y41 ¥ Y Y 
Q42 Q Q E 
M82 T ¥ R 
Y83 x Y F 
K353 K K K 
R357 R R R 


The table shows key residues involved in the binding of the human ACE2 with the viral 
RBD. The structure showed key residues that form a set of polar contacts (as well as one 
hydrophobic contact) with the viral RDB, hence stabilizing the complex. Substitutions that 
increase the hydrophobicity and polarity of the binding surface with respect to the human 
receptor are depicted in gray and orange fonts, respectively. Whereas substitutions that 
increase negative and positive charge are depicted in red and blue fonts, respectively. 


negative charge (e.g., the substitution of E35 in humans, cats and 
dogs, to R35 in chickens), increase hydrophobicity or even 
introduce stearic barriers. These observations suggest that the 
overall residue composition and associated charge distribution 
on the binding interface may be taken as an indicator (or at least 


a contributing factor) of the susceptibility of the host. The latter 
also suggests that the homology of the 10 sites on the ACE2 that 
make direct contact with the RBD, between humans and other 
hosts, may not alone be an efficient indicator of the host 
susceptibility, and a better indicator would be to compare all 
residues that are at or near the binding interface. 

To study this on a wider range of animal species, we 
performed a BLASTP search against the protein reference 
sequence database and retrieved sequences from different 
animal species with the highest sequence identity compared 
with the human ACE2 peptidase domain sequence (see methods 
for details). The multiple sequence alignment of the latter 
containing 249 sequences was used for further analysis. 
Indeed, sequence logo representations (Figure 5) of the 
alignment show that certain residues of those that form direct 
contact with the viral RBD, along with neighboring residues, are 
conserved among many animal species. More specifically, 
residues F32, N33, A36, E37, D355, F356, K353 and R357 
(marked with black and blue arrows in Figure 5) are highly 
conserved in the analysed sequences of the different animal 
species and are present at the receptor-RBD binding interface. 
Residues that are in direct contact with the viral RBD (mostly 
through their side chains) are marked with blue or red arrows. 
K353 and R357 are highly conserved and are in direct contact 


0 
N 


FIGURE 5 | Conservation of residues involved in direct contact between the receptor and the viral RBD. The figure shows sequence logo representation of 
alignments of residues 30-41, residues 82-84 and residues 353-357 of the host receptor (ACE2) from different species. The alignments involved 249 sequences that 
had the highest identity with the human ACE2 isoform 1 sequence. Highly conserved residues in this region are marked with black arrows. Conserved and non- 
conserved residues that are involved in direct contact with the viral RBD are marked with blue and red arrows, respectively. 
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99 


0.050 


NP 001034545.1 ACE 2-precursor Felis catus (Cat) 

XP 039089862.1 ACE 2-isoform X1 Hyaena hyaena (Hyaena) 

XP 029786256.1 ACE 2- Suricata suricatta (Meerkat) 

gg r~ NP 001158732.1 ACE 2-precursor Canis lupus familiaris (Domestic Dog) 

XP 025292925.2 ACE 2- Canis lupus dingo (Australian Dog) 

XP 032187677.1 ACE 2- Mustela erminea (Mink) 

99 _ NP 001297119.1 ACE 2-precursor Mustela putorius furo (Domestic Ferret) 

XP 006973269.1 ACE 2- Peromyscus maniculatus bairdii (Prairie Deer) 

NP 001129168.1 ACE 2-precursor Macaca mulatta (Rhesus monkey) 

NP 001358344.1 ACE 2-isoform 1 precursor Homo sapiens (Human) 

XP 018874749.1 ACE 2- Gorilla gorilla gorilla (Gorilla) 

XP 016798468.1 ACE 2-isoform X1 Pan troglodytes (Chimpanzee) 

XP 020768965.1 ACE 2- Odocoileus virginianus texanus (White-tailed Deer) 

XP 032963186,1 ACE 2- Rhinolophus ferrumequinum (Greater horseshoe bat) 
3! XP 043752042.1 ACE 2- Cervus elaphus (Red Deer) 

XP 024425698.1 ACE 2-isoform X1 Desmodus rotundus (Common vampire) 
XP 037013451.1 ACE 2- Artibeus jamaicensis (Jamaican fruit bat) 

XP 036768816.1 ACE 2- Manis pentadactyla (Chinese Pangolin) 

100! XP 017505752.1 ACE 2- Manis javanica (South Asia Pangolin) 

XP 040517014.1 ACE 2-isoform X1 Gallus gallus (Chicken) 

XP 009925641.1 ACE 2- Haliaeetus albicilla (White-tailed eagle) 

XP 009887331.1 ACE 2-isoform X1 Charadrius vociferus (Killdeer bird) 


FIGURE 6 | Phylogenetic analysis of the residues near the receptor-viral RBD interface. (A) shows the structure of the human ACE2-viral RBD complex (PDB ID 
6lzg), for both front and 180 degrees rotated views, with the residues at or near the receptor-viral RBD depicted in orange, whereas the remainder of the ACE2 
receptor PD domain and the viral RBD domain are colored in light grey. The alignment of latter residues (19-90 and 324-393) were used to construct a phylogenetic 
tree, shown in (B). The latter tree in panel B demonestrats a midpoint rooted maximum likelihood where the related taxa and closed branches are clustered together. 
The tree displays the evolutionary analysis of ACE2 receptor’s residues in contact with the viral RBD in different species and their relationship. 


with the viral RBD. Whereas, residues D30, K31, H34, Y41, M82 
and Y83 (marked with red arrows), along with two other 
residues (Q24 and Q42), are also directly involved in the 
binding, but show moderate to high variations among the 
different animal species. Hence the extent of substitutions in 
the latter (variable) residues can provide a preliminary 
indication of how similar is the binding capacity of the 
different animal receptors compared to human. Neighbouring 
highly variable residues at the binding interface may, however, 
such as E35, D38 and G354 although are not in direct contact 
with the viral RBD in the human receptor, may have 
substitutions that would enhance or interfere with binding in 


other species. These findings confirm the previous notion that 
considered that all residues at or near the binding interface 
would form a better indicator for the likelihood of viral RBD 
binding with a specific host. 

Based on the latter arguments, and for the purpose of finding 
a simple but quantitative indicator of virus entry and 
susceptibility based on comparative sequence analysis with the 
human sequence, we constructed a phylogenetic tree (Figure 6) 
using the alignment blocks of all the residues at the binding 
interface between the ACE2 and the viral RBD. Residues 19-90 
and 324-393 (shown in orange color in Figure 6A) were selected 
for this purpose. 
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Chicken 


Chicken 


Ferret 


FIGURE 7 | Isosurface representations and electrostatic (Coulombic) maps of the predicted ACE2 receptor structures for various animal hosts. The figure shows 
the isosurface representations of the predicted structures of the ACE2 (A) and the corresponding Coulombic coloring of the surface representations (B). The region 
that is likely to affect the binding (residues 19-90 and 324-393) is depicted in distinct colors whereas the rest of the PD domain is depicted in dim grey. The viral 
receptor is hidden to show the overall shape and surface charge of the binding surface on the ACE2 receptor. For clarity, the human receptor-RBD complex is 
shown on the left of both atomic and its isosurface representations (panels A and B, respectively), in the same orientation as of the other models. In the latter, 

the RBD is in blue, the region having residues 19-90 and 324-398 is depicted in orange and the remainder of the PD domain is in dim grey. In panel B, the positively 


charged, negatively charged, and hydrophobic (neutral) regions are depicted in blue, red and white, respectively. 


The phylogenetic analysis and identity matrix of all the 
aligned residues at the binding interface between the ACE2 
and the viral RBD revealed that this region in humans, gorillas 
and chimpanzees are closely related to each other and are 
identical (100%). The latter region identity between humans 
and cats (83.8%) is higher than that of humans and dogs (79.5%). 
While the identity is 78.1% in ermines (minks) and 77.4% in 
ferrets, it is quite low in chickens (64.7%), as presented in 
Figure 6B and Supplementary Table S5. 

To visualize and investigate the impact of mutations at the 
binding interface, homology models were built for the receptor- 
RBD complex, by fixing the sequence of the viral RBD and 


varying the receptor sequence according to the species 
(Figure 7). Models were built using the PDB IDs 6LZG (Wang 
et al., 2020) and 6M17 (Yan et al., 2020) as templates. Figure 7A 
shows the isosurface representations of the obtained models 
(Supplementary Figure S2) for the receptor PD of the selected 
species, namely human, cat, dog, ferret, ermine and chicken. The 
binding interface is colored in different colors, whereas the 
remaining part of the receptor domain is colored in dim gray. 
An analysis of these models revealed that, on the one hand, the 
overall shape and structure of the binding interface region of the 
cat and the dog are more similar to that of the human when 
compared with their counter regions in the ferret and the 
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chicken. These larger variations in the latter (less susceptible) 
species may introduce stearic effects that may reduce or hinder 
the binding. On the other hand, the electrostatic (Coulombic) 
colored isosurfaces (Figure 7B) show the appearance of several 
hydrophobic regions within the binding region (shown in white), 
as well as an increase in the positive charges (shown in blue), in 
the cat and the dog compared with that of the human receptor. 
The latter positively charged regions appear to be more in the 
dog than in the cat. In contrast, the surface of the ferret and the 
chicken receptors appear to be more negatively charged. Similar 
conclusions can be drawn on the analysis of polar contacts and 
potential hydrogen bonding in the same models, i-e., between the 
RBD and receptors from the different species (Supplementary 
Figure §2). It is clear that the number of polar contacts in cats 
and dogs are less than in humans. Out of the studied species, the 
chicken has the least contacts. The ermine and the ferret had a 
different distribution of predicted hydrogen bonds when 
compared with humans. 


DISCUSSION 


Coronaviruses infect a wide variety of hosts (Shafique et al., 
2020). In the present study, we aimed to investigate the 
mutations of the SARS-CoV-2 spike glycoprotein detected in 
cats and their effect on its structure and function by sequencing 
the full spike gene, followed by bioinformatics analysis. 
Furthermore, we highlighted the effect of residues substitution 
on viral infectivity, evolution and evasion from immune 
response. We found that cats with a history of direct contact 
with COVID-19-positive owners were also infected with 
COVID-19, revealing the possibility of SARS-CoV-2 
transmission between humans and cats. 

Generally, the high substitutions and mutations in 
coronaviruses revealed that the virus is adapting to new hosts 
(Duffy, 2018). The significance of amino acid substitution 
depends on the switching of properties and classes; for 
instance, a spike glycoprotein switching from leucine to 
methionine in L452M may have no significance as both are 
hydrophobic and neutral. Similarly, switching from tyrosine to 
phenylalanine in the Y453F mutation may have no significance 
as both are aromatic and hydrophobic. In contrast, substitution 
from glutamine to histidine in the Q498H mutation could 
significantly change the protein and side chain structure 
because they have different features, as histidine has a positive 
charge while glutamine has a neutral charge (Elaswad et al., 
2020). Therefore, significant changes in a spike glycoprotein 
could change its viral infectivity, pathogenicity and virulence, 
and could increase cross-species transmission (Wan et al., 2020). 

In the present study, SARS-CoV-2 incidence in cats was 
detected from four clinical samples, and was consistent with 
molecular results reported in previous studies (Newman et al., 
2020). Furthermore, previous experimental studies have shown 
the clinical susceptibility of cats to SARS-CoV-2 (Hamer et al., 
2021), which may be due to the similarity in ACE2 receptors 
between feline species and humans (Sailleau et al., 2020; 


Patterson et al., 2020; Hamer et al., 2021; Ruiz-Arrondo et al., 
2021). Moreover, cats shed SARS-CoV-2 horizontally after 
infection for up to 6 days (Hamer et al., 2021), and the virus 
can persist in cats post-infection for up to 32 days (Bosco-Lauth 
et al., 2020; Shi et al., 2020). The high positive incidence in our 
study can be attributable to the fact that all tested cat samples 
came from homes with COVID-19-positive human cases. 

SARS-CoV-2 transmission between humans and cats was 
reported in this study as the results of the phylogenetic 
analysis revealed a similarity between SARS-CoV-2 detected in 
cats (OK144251, OK144252 and OK144253 to OK144255) and 
in human cases reported in the same months (EPI_ISL1315064, 
EPI_ISL1936365 and EPI_ISL3274157), as presented in Figure 3 
and Supplementary Table $4. Additionally, the veterinarian 
who was in contact with the infected cats was asymptomatic; 
however, he tested positive for COVID-19 and his sequence 
result was identical to that of the young cat and her mother. 
Moreover, sequence similarity between the SARS-CoV-2 spike 
gene detected in the young cat and her mother confirms that the 
virus is the same. Thus, there is a potential support of SARS- 
CoV-2 transmission between humans and cats, but further 
studies are needed to investigate the direction of the viral 
transmission as the study’s design has not determined the 
infection direction whether animal-to-human or human-to- 
animal or animal-to-animal viral infection occurred. 

The present study revealed changes in the spike glycoproteins 
and the unique amino acid substitutions of the cat samples in 
correspondence with the reference Wuhan isolate, as illustrated 
in Table 1, Figure 4 and Supplementary Figure S1. Seven 
critical amino acid substitutions were observed. The diverse 
and frequent single residue mutations that emerged 
independently and repeatedly suggests their possible adaptive 
role (Kubik et al., 2021). Different spike regions are subjected to 
distinct evolutionary changes as mutations at receptor-binding 
domains (RBDs) result in both changes in the binding affinity to 
ACE2 receptors as well as in viral evasion from neutralizing 
antibodies, whereas mutations in N-terminal domains (NTDs) 
are mostly related to the potential escape from neutralizing 
antibodies (Kubik et al., 2021). Interestingly, some of these 
mutations are unique to the Wuhan-Hu-1 reference isolate and 
are reported in cat samples for the first time in this study, as they 
were not previously detected in cat samples in Hong Kong, 
France, Italy or the USA. The latter mutations are (W152R), 
(A570V), (Q677H), (A899S) and (S1051Y). In this study, we 
focused on the spike gene as it the most variable part of the viral 
genome and it plays crucial role in the receptor binding. It is also 
the protein that carries the receptor binding domain (RBD) 
which is the main point of entry to the host (Fung and Liu, 2019). 
The spike is also the main protein that exhibit mutations due to 
immune escape and generally have a major immunological role 
(Harvey et al., 2021). 

Structural visualization and homology models revealed that 
four residue substitutions were located on the viral surface, 
which may have appeared as viral antigenic evolution to evade 
neutralizing antibodies. These mutations, (W152R), (L452R), 
(A570V) and (A899S), have weakened critical interaction 
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points with the neutralizing antibodies and are associated with 
low sensitivity to mAbs and vaccine sera (Kubik et al., 2021). The 
observed (W152R) mutation is critical and is frequently observed 
across geographical and phylogenetic contexts; thus, it is a 
hotspot for mutations (Kubik et al., 2021). The substitution of 
tryptophan to leucine (W152L) or arginine (W152R) is the most 
common mutation at this location based on the SOPHiA DDM 
database. In addition, the substitution of tryptophan to cysteine 
(W152C) or glycine (W152G) is rare (Sophiagenetics, 2021). 
During the second wave of the SARS-CoV-2 pandemic, between 
December 2020 and April 2021, the (W152R) mutation 
increased steadily and rapidly, with one of the most notable 
adaptive mutations (N501Y) in the receptor-binding domain 
(RBD) coexisting (Kubik et al., 2021). Similarly, (W152R) 
substitution affects viral evasion from neutralizing antibodies 
and escape from immunity, as demonstrated in structural studies 
in which tryptophan (W) removal has weakened critical 
interaction points with antibodies (Kubik et al., 2021). In 
particular, a residue in the antibody chain interacts with W152 
in a hydrophobic pi-stacking interaction, wrapping W152 inside 
one of the complementarity-determining region (CDR) loops in 
the variable chains of antibodies (Kubik et al., 2021). Indeed, 
W152 is a key position that contributes to the latter interaction. 
As a result, changes in W152 impair not only the pi-stacking 
interaction, but also the pocket created by the antibody’s CDR 
loop, resulting in a significant reduction in binding affinity 
(Kubik et al., 2021). Furthermore, (W152R) substitution 
suggests viral evolution to maintain effective binding to the 
ACE2 receptor (Kubik et al., 2021). Therefore, monitoring the 
individual mutations that occur outside the receptor-binding 
domain (RBD) and inside the N-terminal domain (NTD) 
is critical. 

The second mutation observed in this study was (L452R) in 
the receptor-binding motif (RBM), as illustrated in 
Supplementary Figures $2B, C. This mutation was first 
reported in January 2021 in California, USA in the 20C/S:452R 
variant, and its frequency has increased over time (Zhang W 
et al., 2021). Since the first observation, novel variants of SARS- 
CoV-2, including this amino acid substitution, have been seen in 
the Delta/21A/B.1.617.2, Kappa/21B/B.1.617.1 and Epsilon/21C/ 
B.1.427.9 viruses with increased virulence and infectivity 
(Lauring and Hodcroft, 2021). Additionally, in this mutation, a 
factor of a 3- to 6-fold reduction in antibody neutralization titers 
in vaccinated individuals was observed (McCallum et al., 2021). 
Furthermore, this substitution reduced the neutralizing activity 
of monoclonal antibodies against RBD by 14- to 35-fold 
(McCallum et al., 2021). Moreover, a 2-fold increase in viral 
shedding was reported in the involved variants (Deng et al., 
2021). It was also found that (L452R) could strengthen ACE2 
receptor-binding affinity (Chen et al., 2020) and increase viral 
infectivity (Liu et al., 2021). A mutation in the C-terminal 
domain 1 (CTD1) at position 570 was also observed. In 
addition, (A570V) was detected in North America and Asia 
(Guruprasad, 2021). Recently, biophysical and Cryo-EM studies 
demonstrated that some residue substitutions in the spike have 
affected the conformational dynamic equilibrium between the 


open RBD-up and closed RBD-down forms of the spike 
(Henderson et al., 2020). In particular, the authors found that 
certain residue substitutions in the regions of contact between 
domains could affect the equilibrium between the open and 
closed states. For example, the combined mutations in the A570, 
1572, F855 and N856 positions of SD1 and S2 inter-protomer 
regions have led to the dominance of the open RBD-up receptor- 
accessible form of the spike. Therefore, these structural 
rearrangements have resulted in the enhancement of the 
binding affinity of the spike to the ACE2 host receptor 
(Henderson et al., 2020). More specifically, the substitution of 
A570 with a more hydrophobic one, such as L570, has aided in 
such an equilibrium shift. Hence, the hydrophobic (A570V) 
mutation (Supplementary Figure $1) observed in our samples 
may contribute to a similar equilibrium shift. The fourth 
mutation on the surface was (A899S) in the fusion-peptide 
proximal region (FPPR). This mutation was recently observed 
in Jordan (Alanagreh et al., 2021), Saudi Arabia (Alghoribi et al., 
2021) and Italy (Castelli et al., 2021). This substitution is 
recorded in lineage C.36 and is associated with high 
transmissibility and low sensitivity to mAbs and vaccine sera 
(Castelli et al., 2021). 

The fifth mutation in our study (D614G) was found 
downstream receptor-binding domain in the C-terminal 
domains 2 (CTD2). Such a mutation became dominant during 
the emergence of the COVID-19 pandemic (Korber et al., 2020a; 
Yurkovetskiy et al., 2020). It is not only able to enhance the viral/ 
host interaction on the cellular level (Ortega et al., 2020), but it is 
also involved in the upward increase in SARS-CoV-2 
pathogenicity, increasing viral transmission and evasion from 
immune response and vaccine (Korber et al., 2020b). This 
mutation appears to enhance the spread of SARS-CoV-2 and 
increases the viral infectivity that resulted from increasing the 
stability of the spike trimer. Certainly, in such a mutation, there 
is loop misplacement, structural rearrangements and interaction 
between downstream receptor-binding domains, which act as 
wedges within the spike trimer that strengthen the stability and 
prevent pre-maturation of the spike, and improve the strong 
assembly of the virus, resulting in the enhancement of the viral 
binding with the ACE2 receptor, as presented in Figure 4. This 
could facilitate viral entry, infectivity and virulence. It is 
unsurprising that all newly emerged variants contain this 
mutation and that they are more infectious than the Wuhan 
reference virus (Zhang J et al., 2021). The (D614G) substitution 
was reported to improve viral replication in human lung 
epithelial cells and primary human airway tissues by 
improving viral infectivity and stability (Plante et al., 2021). 
Furthermore, this mutation lowered the effectiveness of COVID- 
19 vaccinations in clinical studies (Plante et al., 2021). 

The (Q677H) mutation in the $1/S2 cleavage site was detected 
in both the two cat samples and the veterinarian sample in this 
study. The (Q677H) mutation was firstly identified in clade 20C 
in the USA (Pater et al., 2021). It was also detected in the (Eta/ 
21D/B.1.525) variant (Hodcroft et al., 2021). As the position of 
such a mutation is found in a critical polybasic region of the 
furin-binding site and it could influence $1/S2 cleavage, SARS- 
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CoV-2 infectivity and pathogenicity could be subsequently 
affected (Hodcroft et al., 2021). The seventh mutation observed 
in this study is (S1051Y) in the B-hairpin region of subunit 2 
(S2). The three-dimensional organization of HR1 (residues 910 
—985), CH regions (residues 986-1035), B-hairpin (1035-1068) 
and UH (residues 736-781) is a significant feature of the spike 
glycoprotein structure (Walls et al., 2016). Structural modeling 
and analysis suggest that the (S1051Y) mutation (Figures 4C-E) 
structure. Indeed, this mutation could play a role in viral entry 
during the replication cycle. The latter hypothesis requires 
further experimental investigation. 

Despite the multiple mutations found in the sequences 
presented in this study, all these mutations had been 
previously reported in humans. Hence, although some of the 
latter mutations may be important for the virus strain survival 
within cats, these mutations are not unique to this particular 
host, and it is likely that a cross-infection between humans and 
cats has occurred. These findings raise several questions, 
including: What factors control the SARS-CoV-2 host ranges? 
Do variations in the host receptor play a role in the susceptibility 
of the host to SARS-CoV-2 infection? Can we obtain insights 
into the susceptibility of a species by comparing the species 
receptor with the human receptor? To answer these questions, 
we analyzed the ACE2 receptors from various animal species 
using sequence and structural bioinformatics tools. 

Obviously, many host factors, may play a crucial role in 
susceptibility to SARS-CoV-2 infection. One of the possible 
factors likely to influence the virus host range is the variation in 
the viral receptor itself from one species to another. In this work, 
we showed that simple sequence comparisons (Table 2 and 
Figure 5), phylogenetic analysis (Figure 6) and the structural 
modeling (Figure 7) of residues at the binding interface between 
ACE2 receptors from different hosts and the viral RBD may be 
used as an initial effective indicator of the susceptibility of the host 
to SARS-CoV-2 cross-infection. More specifically, the amino acid 
composition of this region (residues 19-90 and 324-393) may 
provide an indication of the binding capacity of the virus to the 
specific species host cell, and hence a determinant for the 
susceptibility of the host. This study, therefore, reveals that a 
simple phylogenetic analysis and identity matrix of the aligned 
residues could provide a quantitative measure of the likelihood of 
virus entry into the cells of various hosts. 

The phylogenetic analysis and identity matrix of the aligned 
residues at the binding interface between the ACE2 and the viral 
RBD (Figure 6B and Supplementary Table S5) revealed that 
SARS-CoV-2 has the ability to interact with a broad range of 
ACE2 cell receptors in different species, which could increase its 
virulence and could enhance its inter-species transmission (Su 
et al., 2016) (Bolles et al., 2011). The sequence identity of the 
latter region between humans and cats (83.8%) is higher than 
that between humans and dogs (79.5%), which suggests that the 
potential recognition of the RBD-spike glycoprotein in cat ACE 
receptors is stronger than that in dogs, and these results are in 
accordance with a previous study in China (Luan et al., 2020). 
This high sequence identity in cats compared to human could 
explain the relatively high SARS-CoV-2 infection rate in the 
feline species (Hamer et al., 2021). Hence, regular monitoring of 


companion animals for SARS-CoV-2 infection is highly 
recommended. The similarity percentage in ermines (minks) 
(78.1%) is lower than that in cats and dogs, however the reported 
infection rate of SARS-CoV-2 in minks is high, according to the 
fifth report of the OIE (2021). This may be an indicator that there 
are other effective factors in minks which make this species more 
susceptible to SARS-CoV-2 than others, and these factors need 
further investigation. The sequence identity was 74.6% in greater 
horseshoe bats and 78.8% in Chinese pangolins. Previous studies 
have suggested that SARS-CoV-2 from bat CoV RaTG13 and 
pangolin isolates, have sequence identities of 96.3% and 89%, 
respectively. These studies concluded that, bats could be the 
natural origin of SARS-CoV-2 (Bosco-Lauth et al., 2020; Shi 
et al., 2020), and although pangolins cannot transmit SARS- 
CoV-2 directly due to the percentage of divergence, they may 
play an intermediate role (Calvet et al., 2021). Indeed, there is a 
need for further investigation to discover other factors that may 
point to bats being the potential natural host for SARS-CoV-2, 
rather than the receptor homology and binding affinity. Overall, 
the presented work paves the way for binding affinity predictions 
and free energy calculations through molecular dynamics 
studies, and opens the gates for further investigation into 
species susceptibility to SARS-CoV-2 infection. 


CONCLUSIONS 


This study displays critical mutations in the spike glycoprotein of 
SARS-CoV-2 detected in cats and in their treating veterinarian, 
through full spike gene sequencing, structural visualization and 
homology modeling. The analysis of the detected mutations 
revealed potential evidence for spillover of SARS-CoV-2 
between humans and cats. Furthermore, the reported 
mutations could affect the viral evasion from immune 
response, the viral infectivity and pathogenicity. Hence, regular 
complete genome sequencing of SARS-CoV-2 in humans and 
nearby companion animals, as well as studying the 
recombination events in such viruses, are recommended in 
order to track newly emerging mutations and investigate their 
effects on viral adaptation, transmission, and their impact on 
therapeutics and vaccine design. 
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